Genome-Wide Identification and Evolutionary Analysis of the Animal Specific ETS Transcription Factor Family
نویسندگان
چکیده
The ETS proteins are a family of transcription factors (TFs) that regulate a variety of biological processes. We made genome-wide analyses to explore the classification of the ETS gene family. We identified 207 ETS genes which encode 321 ETS TFs from ten animal species. Of the 321 ETS TFs, 155 contain only an ETS domain, about 50% contain a ETS_PEA3_N or a SAM_PNT domain in addition to an ETS domain, the rest (only four) contain a second ETS domain or a second ETS_PEA3_N domain or an another domain (AT_hook or DNA_pol_B). A Neighbor-Joining phylogenetic tree was constructed using the amino acid sequences of the ETS domain of the ETS TFs. The results revealed that the ETS genes of the ten species can be divided into two distinct groups. Group I contains one nematode ETS gene and 18 vertebrate animal ETS genes. Group II contains the majority of the ETS TFs and can be further divided into eleven subgroups. The sequence motifs outside the DNA-binding domain and the conservation of the exon-intron structural patterns of the ETS TFs in human, cattle, and chicken further support the phylogenetic classification among these ETS TFs. Extensive duplication of the ETS genes was found in the genome of each species. The duplicated ETS genes account for ~69% of the total of ETS genes. Furthermore, we also found there are ETS gene clusters in all of the ten animal species. Statistical analysis of the Gene Ontology annotations of the ETS genes showed that the ETS proteins tend to be related to RNA biosynthetic process, biopolymer metabolic process and macromolecule metabolic process expected from the common GO categories of transcriptional factors. We also discussed the functional conservation and diversification of ETS TFs.
منابع مشابه
In Silico Genome-Wide Screening for TnrA-Regulated Genes of Bacillus clausii
Bacillus clausii TnrA transcription factor is required for global nitrogen regulation. In order to obtain anoverview of gene regulation by TnrA in B. clausii KSMK16, the entire genome of B. clausii was screened forthe consensus sequence, 5’-TGTNAN7TNACA-3’ known as the TnrA box, and 13 transcription units werefound containing a putative TnrA box. The TnrA targets identified in...
متن کاملGene Family: Structure, Organization and Evolution
Gene families are considered as groups of homologous genes which they share very similar sequences and they may have identical functions. Members of gene families may be found in tandem repeats or interspersed through the genome. These sequences are copies of the ancestral genes which have underwent changes. The multiple copies of each gene in a family were constructed based on gene duplicati...
متن کاملDNA Specificity Determinants Associate with Distinct Transcription Factor Functions
To elucidate how genomic sequences build transcriptional control networks, we need to understand the connection between DNA sequence and transcription factor binding and function. Binding predictions based solely on consensus predictions are limited, because a single factor can use degenerate sequence motifs and because related transcription factors often prefer identical sequences. The ETS fam...
متن کاملBioinformatics Genome-Wide Characterization of the WRKY Gene Family in Sorghum bicolor
The WRKY gene family encodes a large group of transcription factors that regulate genes involved in plant response to biotic and abiotic stresses. Sorghum is a notable grain and forage crop in semi-arid regions because of its unusual tolerance against hot and dry environments. We identified a set of 85 WRKY genes in the S. bicolor genome and classified them into three groups (I–III). Among the ...
متن کاملETS1 is a genome-wide effector of RAS/ERK signaling in epithelial cells
The RAS/ERK pathway is commonly activated in carcinomas and promotes oncogenesis by altering transcriptional programs. However, the array of cis-regulatory elements and trans-acting factors that mediate these transcriptional changes is still unclear. Our genome-wide analysis determined that a sequence consisting of neighboring ETS and AP-1 transcription factor binding sites is enriched near cel...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 5 شماره
صفحات -
تاریخ انتشار 2009